[cs615asa] HW5 & HW6

Tianpei Luo tluo4 at stevens.edu
Fri Mar 31 15:21:30 EDT 2017


What about the longest word in the retrieved Pages?Is that means find out
the longest words that only showing on browser?(not include the script,
tags, metadata)

On Fri, Mar 31, 2017 at 3:11 PM, Tianpei Luo <tluo4 at stevens.edu> wrote:

> I think it should be the domain+page_title.
>
> On Thu, Mar 30, 2017 at 3:22 PM, Tianxiao Yang <tyang8 at stevens.edu> wrote:
>
>> > How many unique objects were requested?
>> What is the object in assignment-5?
>>
>> On Thu, Mar 30, 2017 at 10:55 AM, Tianpei Luo <tluo4 at stevens.edu> wrote:
>>
>>> I thought the problem is to calculate the unique objects which means the
>>> same title in different domains should be count once. Or it should be twice.
>>>
>>> On Thu, Mar 30, 2017 at 10:29 AM, Jan Schaumann <jschauma at stevens.edu>
>>> wrote:
>>>
>>>> Tianpei Luo <tluo4 at stevens.edu> wrote:
>>>>
>>>> > I have some problem with HW5. When I want to sort the input file to
>>>> > get the unique objects for the first question, it show no space for
>>>> > sorting(only 400mb space left for micro type) Is there any good choice
>>>> > for this question or just using m1.small for a large space?
>>>>
>>>> (Accidental complexity)
>>>> Do you need to sort?  If so, do you need to sort the whole file?
>>>>
>>>> (Inherent complexity)
>>>> If you need to sort, how are you trying to do that?
>>>>
>>>> -Jan
>>>> _______________________________________________
>>>> cs615asa mailing list
>>>> cs615asa at lists.stevens.edu
>>>> https://lists.stevens.edu/mailman/listinfo/cs615asa
>>>>
>>>
>>>
>>> _______________________________________________
>>> cs615asa mailing list
>>> cs615asa at lists.stevens.edu
>>> https://lists.stevens.edu/mailman/listinfo/cs615asa
>>>
>>>
>>
>> _______________________________________________
>> cs615asa mailing list
>> cs615asa at lists.stevens.edu
>> https://lists.stevens.edu/mailman/listinfo/cs615asa
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.stevens.edu/pipermail/cs615asa/attachments/20170331/4487b6cc/attachment.html>


More information about the cs615asa mailing list